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Copyright (c) 1993 - 2003 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: 



Title: 

Perfect score 
Sequence : 



October 23, 2003, 18:15:42 ; Search time 4096 Seconds 

(without alignments) 
1290b. 819 Million cell updates/ 



US-09-864-464-9 
2175 

1 actt gagcctccacct t etc 



aattggtgct t t t ga tagee 217b 



Scoring table: IDENTITY NUC 

Gapop 10.0 , Gapext 1.0 



Sea rched : 



22781392 seqs, 12152238056 residues 



Total number of hits satisfying chosen parameters 



45562784 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post -processing : 



Minimum Match 0% 
Maximum Match 100% 
Listing first 45 summaries 



Database : 



EST 
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12 
13 
14 
15 
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23 
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25 
26 

2 7 



em_estba : * 
em_esthum: * 
em_es tin : * 
em_cstmu : * 
em_os t ov : * 
em_estpl : * 
em_estro : * 
em _ht c : * 
gb_est 1 : * 
gb_est2 : * 
gb_htc : * 
gb__est 3 : * 
gb_est4 : * 
gb_est5 : * 
em_est f un : 
em _estom : * 
em_ gss_hum 
em_gss i nv 
em_gss_pln 
em__gss_vrt 
em gss _f un 
em_gss_mam 
em_gs s_mus 
em_gss pro 
em_gss_rod 
em_gss_phg 
em gss vrl 



* 
* 

t 
t 



28: gb_gssl:* 
29: gb_gss2:* 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Cornpugen Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: 



Title: 

Perfect score: 
Sequence : 



October 23, 2003, 16:56:57 ; Search time 7708 Seconds 

{without alignments) 
11543.648 Million cell updates/ 



US-09-864-464-9 
2175 

1 acttgagect ccacct tctc 



aa t t gg tgettttgatagee 2175 



Scoring table: IDENTITY NUC 

Gapop 10.0 , Gapext 1.0 



Searched: 



2888711 seqs, 20454813386 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length : 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



5777422 



Database 



GenEmbl : * 
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gb_ba : * 
gb_htg : * 
gb_in : * 
gb_om: * 
gb_ ov : * 
gb_pat : * 
gb_ph : * 
gb pi : * 
gb_pr : * 
gb_ro : * 
gb_sts : * 
gb sy:* 
gb_ un : * 
gb_vi : * 
em_ba : * 
em_f un : * 
em hum : * 
em_in : * 
em_mu : * 
em_om : * 
em_or : * 
em_ov : * 
em_pat : * 
em_ph : * 
em_pl : * 
em_ro : * 
em s t s : * 
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: em 


htg other:* 
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: em 


htg mus : * 


34 . 


: em 


htg pin : * 
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: em 


htg mam : * 


37 


: em 


htg vrt : * 


38 


: em 


sy : * 
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htgo hum: * 


40 


: em 


htgo mus : * 


41 


: em 


htgo other : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result Query 

No. Score Match Length DB ID Description 



1 

1 


z 1 / 5 


100 


. 0 


z 1 / 5 


6 


AR1 64421 


AR164421 Sequence 


2 


897. 4 


41 


.3 


1938 


8 


BT002123 


BT002123 Arabidops 


3 


897. 4 


41 


.3 


2083 


8 


AY099551 


AY099551 Arabidops 


4 


897. 4 


41 


. 3 


2193 


8 


AF125574 


AF125574 Arabidops 


5 


886.4 


40 


. 8 


2143 


6 


AR164423 


AR164 4 23 Sequence 


6 


745. 6 


34 


. 3 


2075 


8 


LETjYSRSGN 


X94451 L.esculentu 


7 


539. 4 


24 


. 8 


1970 


10 


CLLYTRSY 


Z317H C.longicaud 


8 


539. 4 


24 


.8 


1997 


9 


BC004132 


BC004132 Homo sapi 


9 


539.2 


24 


. 8 


2000 


9 


AF285758 


AF285758 Homo sapi 


10 


538 


24 


.7 


1970 


9 


IIUM0RFKG1S 


D31890 Homo sapien 


11 


538 


24 


.7 


1997 


6 


AX214528 


AX214528 Sequence 


12 


538 


24 


.7 


1997 


9 


D32053 


D32053 Homo sapien 


13 


537.8 


24 


. 7 


2023 


10 


BC036289 


BC036289 Mus muscu 


14 


537 . 8 


24 


. 7 


2146 


10 


BC035324 


BC035324 Mus muscu 


15 


537. 8 


24 


.7 


2205 


10 


BC027356 


BC027356 Mus muscu 


1.6 


532.2 


24 


.5 


2036 


5 


BC046578 


BC046578 Xenopus I. 


17 


527 


24 


.2 


2076 


5 


BC047965 


BC047965 Xenopus 1 


18 


492. 6 


22 


. 6 


2205 


3 


AK114863 


AK114863 Ciona int 


19 


4 62.2 


21 


.3 


1896 


3 


AY089547 


AY089547 Drosophil 


20 


457. 4 


21 


. 0 


604 


6 


AR164422 


AR164422 Sequence 


21 


447 . 6 


20 


. 6 


42696 


2 


AC012813 


AC012813 Drosophil 


22 


4 4 7.6 


20 


.6 


159970 


3 


AC023702 


AC023702 Drosophil 


23 


447.6 


20 


. 6 


161360 


3 


AC105292 


AC105292 Drosophil 


24 


447.6 


20 


. 6 


289621 


3 


AE003447 


AE003447 Drosophil 


25 


439 


20 


.2 


2480 


8 


SCKRS1 


X56259 S.cerevisia 


26 


437. 4 


20 


. 1 


2708 


8 


SCYDR037W 


Z74333 S.cerevisia 


27 


4 37 . 4 


20 


. 1 


5180 


8 


YSCKRS1A 


J04186 Yeast (S.ce 


28 


437. 4 


20 


. 1 


24628 


8 


SC9673 


Z681 96 S . cerevisia 


29 


435.8 


20 


.0 


1776 


6 


AX596068 


AX596068 Sequence 


30 


425. 4 


19 


. 6 


32618 


8 


SPBC17G9 


AL109846 S.pombe c 


31 


423. 8 


19 


. 5 


107448 


2 


DMBR4 1K5 


AL121843 Drosophil 


32 


402. 8 


18 


.5 


1991 


6 


AR071755 


AR071755 Sequence 


33 


400.8 


18 


.4 


1785 


6 


AX488967 


AX488967 Sequence 



GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd 



OM nucleic - nucleic search, using sw model 



Run on: 



October 23, 2003, 16:54:42 ; Search time 563 Seconds 

(without alignments) 
10428.557 Million cell updates/sec 



Title: 

Perfect score 
Sequence : 

Scoring table 



Searched : 



US-09-864-464-9 

2175 

1 acttgagoctccaccttctc 



aattggtgct t ttgatagee 2175 



IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 



2552756 seqs, 1349719017 residues 



Tota.1 number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



5105512 



Post -processing: 



Minimum Match 0% 
Maximum Match 100-6 
Listing first 45 summaries 



Database : 



N Geneseq 19Jun03: 



1 

2 
3 
4 
5 
6 
7 
8 
9 
10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 
25 



/S IDS 1/gcgdat a /geneseq/ gene seqn- 
/SIDS 1/gcgdata/ genes eq/geneseqn- 
/ SIDS 1/ gcgdat a /gene seq/geneseqn- 
/S1DS1 / gcgda t a /geneseq/ genes eqn- 
/S IDS 1/gcgdat a /gene seq/ gene seqn- 
/S I DS 1/gcgdat a /geneseq/ geneseqn- 
/S I DS1 / g cgdat a /gene seq /genes eqn- 
/SIDS1 /gcgdat a/genes eq/geneseqn- 
/SI DS 1/gcgdat a/ gene seq/ geneseqn- 
/S IDS 1/gcgdata /gene seq/ gene seqn 
/SI DS 1/ gcgda ta/geneseq/geneseqn 
/SI DS1 /gcgdat a /geneseq/ gene seqn 
/SIDS1 / gcgdat a /geneseq /gene seqn 
/SIDS 1/gcgdat a/gene seq/ gene seqn 
/SI DS1 /gcgda ta /geneseq /gene seqn 
/SIDS 1/gcgdat a /gene seq /genes eqn 
/S I DS1 /gcgdat a /geneseq/ gene seqn 
/SI DS 1/ gcgda ta/geneseq/geneseqn 
/SI DS1 / gcgdat a /gene seq /gene seqn 
/SI DS1 / gcgdat a /gone seq/ gene seqn 
/SI DS1 /gcgda ta/geneseq/geneseqn 
/SIDS 1 /gcgda ta/geneseq/geneseqn 
/SIDS 1/gcgdat a/ gene seq /gene seqn 
/SIDS 1/gcgdata /geneseq/ gene seqn 
/SIDS 1 /gcgda ta/geneseq/geneseqn 



embl /NA1 98 0 . DAT 
embl/NA1981 . DAT 
embl/NA1982 . DAT 
embl /NA1 98 3 . DAT 
embl /NA1984 . DAT 
embl /NAT 985 . DAT 
embl/NAl 98 6 . DAT 
embl/NA1987 . DAT 
embl/NA1988 . DAT 
-embl/NA1989. DAT 
-embl/NAl 990 . DAT 
-embl/NAl 991 . DAT 
-embl/NAl 992 . DAT 
-embl /NA1 993 . DAT 
-embl/NA1994 . DAT 
-embl/NA1995 . DAT 
-embl/NAl 9 9 6. DAT 
-embl/NA1997 . DAT 
-embl/NA1998 . DAT 
-embl /NA1 999 . DAT 
-embl/NA2000 . DAT 
-embl /NA200 1A . DAT 
-embl /NA2001B . DAT 
-embl/NA2002. DAT: 
- emb 1 / NA2 003. DAT : 



* 

* 
★ 



* 



Pred. No. is the number of results predicted by chance to have a 



score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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cDNA corresponding 
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7 
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Human cDNA encodin 
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2169 


25 
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21 . 
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4 4 7.6 
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23 
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20 


AAX / 6 3 3 2 


Pneumocystis carin 


1 3 


4 00 . o 


18. 


4 


i "7 n c 

17 8 5 


2 4 


ABZ31 98 0 


Candida albicans e 


14 


367 . 4 


16. 


9 


1818 


25 


ABT19056 


Aspergillus fumiga 


15 


3 67 . 4 


16. 


9 


1818 


25 


ABT20876 


Aspergillus fumiga 


16 


354 . 4 


16. 


3 


1903 


25 


ABT184 62 


Aspergillus fumiga 


17 


354 . 4 


16. 


3 


1903 


25 


ABT20278 


Aspergillus fumiga 


18 


354.4 


16. 


3 


3903 


25 


ABT17868 


Aspergillus fumiga 


19 


354.4 


16. 
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3903 


25 


ABT1 9682 


Aspergillus fumiga 


pi 
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34 8 


16. 
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O T O Pi 

2128 


20 


AAX7 637 3 


P. carinii lysy.L-t 


21 


338 . 8 
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6 
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AAF87085 


Wheat lysyl-RNA sy 


22 


264 


12. 


1 


1487 


23 


AAS51633 


Staphylococcus aur 


23 


2 63.8 


12. 
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1521 


24 


ABN91 00 / 


Staphylococcus epi 


24 


z bz . D 


11 . 


6 


1 bO 3 


24 


ABN6 65 8 0 


Streptococcus poly 


2b 


2 4 9.2 


11. 


5 


14 8 8 


2 4 


ABN7 0907 


Streptococcus poly 


2 D 


2 4 9.2 


11. 
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1 P A Pi 

1 b 9 9 


2 4 


ABN6657 9 


Streptococcus poly 


27 


2 4 9.2 


11 . 
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21 55o 61 


24 


ABN7 1527 


Streptococcus poly 


28 


244.4 


11 . 


2 


14 97 


23 


AAS52822 


Enterococcus faeca 


29 


244.4 


11 . 


2 


150 0 


23 


AAS51370 


Enterococcus faeca 


*"i p\ 

30 


o i pi 
2 39 


11 . 


0 


2944528 


24 


ABA0304 1 
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Listeria monocytog 


31 


2 38.6 
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1518 


23 


AAS52719 


E. coli DNA for ce 


32 


237.6 


10. 


9 


6617 


22 


AAI97964 


Lawsonia intracell 


33 


235.8 


10. 


8 


2365589 


24 


ABA90521 


Genomic sequence o 


34 


234 . 6 


10. 


8 


4066 


22 


AAH54963 


S. epidermidis gen 


35 


232 . 8 


10. 


7 


45613 


22 


AAF28535 


Genomic fragment # 


36 


229.8 


10 . 


6 


1163020 


24 


ABQ67197 


Listeria innocua c 


37 


229.8 


10. 


6 


3011208 


24 


ABQ69245 


Listeria innocua D 


38 


227 . 8 


10. 


5 


1473 


25 


ABX06381 


S. pneumoniae type 


39 


227.8 


10. 


5 


1491 


23 


AAS55666 


Streptococcus pneu 


40 


227 . 8 


10. 


5 


3232 


19 


AAV52277 


Streptococcus pneu 


41 


227 . 8 


10 . 


5 


2162598 


25 


ABS56454 


Streptococcus pneu 


42 


226.2 


10. 


4 


1491 


18 


AAT92930 


Streptococcus pneu 


43 


226.2 


10. 


4 


1491 


22 


AAH42416 


Nucleotide sequenc 


44 


224 . 6 


10. 


3 


580073 


18 


AAT58840 


Mycoplasma genital 


4 5 


218 


10. 


0 


263 


25 


A3X84849 


Corn ear-derived p 



ALIGNMENTS 



GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Corapugen Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: 



October 23, 2003, 18:21:52 ; Search time 152 Seconds 

(without alignments) 
6315.844 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-09-864-464-9 
2175 

1 acttgagcctccaccttctc 



aattggtgcttttqatagcc 2175 



Scoring table: IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 

Searched: 569978 seqs, 220691566 residues 

Total, number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB scq lenqth: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



1139956 



Database : 



1 

2 
3 
4 
5 
6 



Issued_Patents_NA: * 

/cgn2_6/ptodata/2/ina/5A_COMB. seq: * 
/cgn2_6/ptodata/2/ina/5B COMB. seq: * 
/cgn2_6/ptodata/2/ina/6A_COMB. seq: * 
/cgn2_6/ptodata/2/ina/6B_COMB. seq: * 
/cgn2_6/ptodata/2/ina/PCTUS_COMB. seq: 
/cgn2_6 /ptodat a/2 /ina/backf lies 1 . seq : 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 
No. 


Score 


Query 
Match 


Length 


DB 


ID 








Description 


1 


2175 


100.0 


2175 


3 


US- 


09- 


357 


-251-9 


Sequence 


9, Appli 


2 


886.4 


40.8 


2143 


3 


US- 


09- 


357 


-251-13 


Sequence 


13, Appl 


3 


457 . 4 


21 . 0 


604 


3 


US- 


09- 


357 


-251-11 


Sequence 


11, Appl 


4 


402.8 


18.5 


1991 


2 


US- 


08- 


415 


-593-40 


Sequence 


40, Appl 


5 


348 


16.0 


2128 


2 


US- 


08- 


415 


-593-39 


Sequence 


39, Appl 


6 


338 . 8 


15.6 


702 


3 


US- 


09- 


357 


-251-15 


Sequence 


15, Appl 


7 


263.8 


12. 1 


1521 


A 


US- 


09- 


134 


-001C-470 


Sequence 


470, App 


8 


231.8 


10.7 


1566 


4 


US- 


09- 


328 


-352-1692 


Sequence 


1692, Ap 


9 


231 . 6 


10.6 


1290 


4 


us- 


09- 


107 


-532A-3187 


Sequence 


3187, Ap 


10 


227.8 


10.5 


3232 


4 


US- 


08- 


961 


-527-144 


Sequence 


14 4, App 


11 


226.2 


10.4 


1491 


3 


US- 


08- 


833 


-876-1 


Sequence 


1, Appli 



GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd . 



OM nucleic - nucleic search, using sw model 



Run on: 



October 23, 2003, 20:48:08 ; Search time 575 Seconds 

(without alignments) 
10144.203 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-09-864-464-9 
2175 

1 act t gagcctccacct tctc 



aattggtgct tttgatagee 2175 



Scoring table: IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 



Sea rched : 



1792395 seqs, 1340900451 residues 



Total number of hits satisfying chosen parameters 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



3584790 



Database : 



Publ ished 



1 

2 
3 
4 
5 
6 
7 

8 
9 
10 
11 
12 
13 
14 
15 
16 
17 



/cgn2 _ 
/cgn2_ 
/cgn2_ 
/cgn2_ 
/cgn2_ 
/cgn2_ 
/cgn2_ 
/cgn2 
/cgn2_ 
/cgn2 
/cgn2 
/cgn2 
/cgn2 
/cgn2 
/cgn2 
/cgn2 
/cgn2 



_Applications_NA: * 

6/ptodata/l/pubpna/US07_PUBCOMB. seq: * 
6/ptodata/l/pubpna/PCT_NEW_PUB. seq: * 
6/ptodata/l/pubpna/US0 6 _NEW_PUB. seq: * 
6/ptodata/l/pubpna/US06^PUBCOMB. seq: * 
6/ptodata/l/pubpna/US07_NEW_PUB. seq: * 
6/ptodata/l/pubpna/PCTUS_PUBCOMB. seq: 
6/ptodata/l/pubpna/US08_NEW_PUB. seq: * 
6/ptodata/l/pubpna/US08__PUBCOMB. seq: * 
6/ptodata/l/pubpna/US09A_PUBCOMB. seq: 
_6/ptodata/l/pubpna/US09B_PUBCOMB. seq 
_6/ptodata/l/pubpna/US09C_PUBCOMB. seq 
_6/ptodata/l/pubpna/US09_NEW_PUB. seq: 
_6/ptodata/l/pubpna/US10A_PUBCOMB. seq 
_6/ptodata/l/pubpna/US10B_PUBCOMB. seq 
_6/ptodata/l/pubpna/US10_NEW__PUB. seq: 
_6/ptodata/l/pubpna/US60_NEW^PUB. seq: 
6/ptodata/l/pubpna/US60 PUBCOMB. seq: 



. * 

• 

* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Resu I t 



Query 



No. 


Score 


Match 


Length 


DB 


ID 


1 


535 . 8 


24 . 


6 


1639 


14 


US-10-043-487-104 


2 


527 . 8 


24 . 


3 


2169 


13 


US-10-044-090-347 


3 


400. 8 


18. 


4 


1785 


12 


US-1.0-032-585-6267 


4 


367 . 4 


16. 


9 


1818 


14 


US- 10- 128 -7 1.4 -22 2 6 


5 


367 . 4 


16. 


9 


1818 


14 


US-10-128-714-7226 


6 


354 . 4 


16. 


3 


1903 


14 


US-10-128-714-1226 


7 


354 . 4 


16. 


3 


1903 


14 


US-1 0-128-7 14- 622 6 


8 


354 . 4 


16. 


3 


3903 


14 


US-10-128-714-226 


9 


354 . 4 


16. 


3 


3903 


14 


US-10-128-71 4-5226 


10 


264 


12. 


1 


1487 


9 


US-09-815-242-4215 


11 


244 . 4 


11 . 


2 


1497 


9 


US-09-815-242-6459 


12 


244.4 


11 . 


2 


1500 


9 


US-0 9-8 15-2 4 2-3 952 


13 


238 . 6 


11. 


0 


1518 


9 


US -0 9- 81 5-2 42-6356 


14 


237 . 6 


10. 


9 


6617 


14 


US-10-210-296-1 


15 


232 . 4 


10. 


7 


234 


9 


US-09-923-876-18 


16 


227 . 8 


10. 


5 


1491 


9 


US-0 9- 8 15-2 4 2- 9303 


17 


224 . 6 


10. 


3 


5800 /3 


12 


US-10-205-220-1 


18 


224 


10. 


3 


38459 


12 


US-09-960-858-3 


19 


224 


10. 


3 


38459 


1 2 


US-09-960-870-3 


20 


216.6 


10. 


0 


1506 


9 


US-09-815-242-7203 


21 


215. 6 


9. 


9 


565 


14 


US-1 0-066-5 4 3-2 312 


22 


214.2 


9. 


8 


873 


9 


US-09-815-242-8618 


c 23 


212 


9. 


7 


1830121 


14 


US-10-329-960-1 


24 


210.2 


9. 


7 


1509 


9 


US-09-815-242-7108 


25 


205. 8 


9. 


5 


484 


11 


US-09-918-9 95-2552 5 


26 


204 . 6 


9. 


4 


1518 


9 


US-09-741-669-159 


27 


204 . 6 


9. 


4 


1518 


9 


US-09-815-242-6188 


28 


198 . 8 


9. 


1 


554 


10 


US-09-796-692-5756 


29 


198 . 8 


9. 


1 


554 


14 


US-10-040-862-5756 


c 30 


198 . 4 


9. 


1 


680 


9 


US-09-835-992A-39 


31 


192 . 2 


8 . 


8 


640681 


10 


US-09-790-988-1 


32 


191.4 


8 . 


8 


987 


9 


US-09-815-242-9774 


c 33 


179.2 


8 . 


2 


537 


14 


US-10-066-543-163 


34 


169 


7 . 


8 


975 


10 


US-09-974-300-1 181 


35 


167. 4 


7 . 


7 


347 


10 


US-09-920-300A-1236 


36 


167.4 


7 . 


7 


347 


12 


US-10-099-926-1236 


37 


167.4 


7 . 


7 


347 


13 


US-10-033-528-1236 


c 38 


167.4 


7 . 


7 


873 


8 


US-08-781-986A-196 


39 


152.2 


7 . 


0 


1506 


9 


US-09-815-242-7845 


40 


147.4 


6. 


8 


432 


11 


US-09-918-995-16229 


41 


144 . 8 


6. 


7 


648 


10 


US-09-974-300-5592 


42 


144 . 6 


6. 


6 


497 


11 


US-09-918-995-20671 


43 


128 


5. 


9 


128 


9 


US-09-294-093B-1888 


44 


125.8 


5. 


8 


1518 


10 


US-09-712-363-129 


45 


117.2 


5. 


4 


420 


9 


US-09-815-242-8050 



Description 



Sequence 104, App 
Sequence 347, App 
Sequence 6267, Ap 
Sequence 2226, Ap 
Sequence 7226, Ap 
Sequence 1226, Ap 
Sequence 6226, Ap 
Sequence 226, App 
Sequence 522 6, Ap 
Sequence 4215, Ap 
Sequence 6459, Ap 
Sequence 3952, Ap 
Sequence 6356, Ap 

Sequence 1, Appli 
Sequence 18, Appl 
Sequence 9303, Ap 
Sequence 1, Appli 
Sequence 3 , Appli 
Sequence 3 , Appl i 
Sequence 7 2 03, Ap 

Sequence 2312, Ap 
Sequence 8618, Ap 

Sequence 1, Appli 
Sequence 7108, Ap 

Sequence 25525, A 
Sequence 159, App 
Sequence 6188, Ap 
Sequence 57 56, Ap 
Sequence 57 5 6, Ap 
Sequence 39, Appl 

Sequence 1 , Appl i 
Sequence 977 4 , Ap 
Sequence 163, App 
Sequence 1181, Ap 
Sequence 1236, Ap 
Sequence 1236, Ap 
Sequence 1236, Ap 
Sequence 196, App 
Sequence 7845, Ap 
Sequence 16229, A 
Sequence 5592, Ap 
Sequence 20671, A 
Sequence 1888, Ap 

Sequence 12 9, App 
Sequence 80 50, Ap 



ALIGNMENTS 



RESULT 1 

US-10-043-487-104 



GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM protein - nucleic search, using frame _ plus_p2n model 



Run on : 



October 23, 2003, 21:56:34 ; Search time 409 Seconds 

(without alignments) 
4184.459 Million cell updates/sec 



Title : 

Perfect score: 
Sequence : 



US-09-864-464-10 
3331 

1 LSLHLLRVSPSSPFSFRSPL 



LTDSQNIKEVLLFPAMKPQE 634 



Scoring table: 

Searched : 
Total number c 



BLOSUM62 
Xgapop 10.0 , 
Ygapop 10.0 , 
Fgapop 6.0 , 
Delop 6.0 , 

2552756 seqs, 

hits satisfy! 



Xgapext 0 . 5 

Ygapext 0 . 5 

Fgapext 7 . 0 

Delext 7.0 

1349719017 residues 

g chosen parameters: 5105512 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 4 5 summaries 

Command line parameters: 
-MODEL-f rame+j?2n. model -DEV=xlp 

Q=/cgn2_l/USPTO_spool_p/US098 64 4 64/runat_2 2102003_171604_3925/app_query . fast 
775 

-DB=N Gcneseq_19Jun03 -QFMT=fastap -SUFFIX=p2n . rng -MINMATCH=0 . 1 -LOOPCL=0 
-LOOPEXT=0 -UNITS=bits -START=1 -END=-1 -MATRIX=blosum62 -TRANS=human4 0 . cdi 
-LIST-45 -DOCALIGN=200 -THR_SCORE=pct -THR_MAX=100 -THR_MIN=0 -AL1GN=15 
-MODE=LOCAL -OUTFMT=pto -NORM--cxt -HEAPS I ZE- 500 -MINLEN=0 -MAXLEN=2000000000 
-USER=US09864464_@CGN_l_l_490_@runat 22 1 02003__17 1 604_3925 -NCPU=6 -ICPU-3 
-NO_MMAP -LARGEQUERY -NEG__SCORES = 0 -WATT -OS PBLOCK=l 00 -LONGLOG 
-DEV_TIMEOUT = 120 -WARN_T IMEOUT=30 -THREADS -1 -XGAPOP=10 -XGAPEXT-0 . 5 -FGAPOP 
-FGAPEXT = 7 -YGAPOP=10 -YGAPEXT -0.5 -DELOP=6 -DELEXT=7 



Database : N_Geneseq_l 9 Jun03 : * 

1 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA1980 . DAT: * 

2 : /SIDSl/gcgdata/geneseq/ger.cscqn-embl/NA1981 . DAT: * 

3 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA1982 . DAT: * 

4 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA198 3 . DAT: * 

5: /SIDSl/gcgdata/geneseq/geneseqn-embl/NA1984 . DAT: * 

6: /SIDSl/gcgdata/goneseq/geneseqn-embl/NA198 5 . DAT: * 

7 : /SIDSl/gcgdata/gcneseq/gcneseqn-embl/NA198 6 . DAT: * 

8 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA1987 . DAT: * 

9: /SIDSl/gcgdata/geneseq/geneseqn-embl/NA198 8 . DAT: * 

10 : /SIDS1 /gcgdata/geneseq/geneseqn-embl/NA198 9 . DAT : * 



11 

12 
13 
14 
15 
16 
17 
18 
1 9 
20 
21 
22 
23 
24 
2b 



/SIDS 1 /gcgdata /genes eq/geneseqn- 
/SIDS1 /gcgdata/geneseq/ gene s eqn - 
/SIDS1 /gcgda t a/gene seq /genes eqn- 
/SIDS1 /gcgda ta/geneseq/geneseqn- 
/S IDS 1 /gcgda t a /gene seq/ geneseqn- 
/S I DS1 /gcgda t a/gene seq /genes eqn- 
/SIDS1 /gcgdata /geneseq /genes eqn- 
/STDS1 /gcgdata/geneseq/ geneseqn- 
/S I DS1 /gcgdata /gene seq /genes eqn- 
/SIDS1 /gcgda ta /gen es eq/geneseqn • 
/S I DS1 /gcgdata /gene seq/ geneseqn- 
/SIDS1 /gcgdata /gene seq /genes eqn- 
/S I DS1 /gcgdata /qeneseq/geneseqn- 
/S I DS1 /gcgdata /gene seq /genes eqn' 
/S I DS1 /gcgdata /geneseq /genes eqn 



embl/NA1990. DAT 
embl/NA1991 . DAT 
embl/NA1992 . DAT 
•embl /NA1 993 . DAT 
embl /NA1 994 . DAT 
embl/NA1995 . DAT 
embl /NA1 9 9 6. DAT 
•embl /NA1 997 . DAT 
•embl/NA1998 . DAT 
-embl /NA1 999. DAT 
•embl /NA2 000 . DAT 
•embl/NA2001A. DAT: 
•embl /NA2001B . DAT : 
•embl/NA2002 . DAT : * 
-embl/NA200 3 . DAT : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 



% 

Query 



o . 


Score 


Match 


Length DB 


ID 


De scr i pt l on 


1 


3331 


100. 


0 


2175 


22 


AAF87082 


Corn lysyl-RNA syn 


2 


2336 


70. 


1 


2143 


22 


AAF87084 


Soybean lysyl-RNA 


3 


2287 . 5 


68 . 


7 


1972 


21 


AAC51249 


Arabidopsis thaLia 


4 


1778 


53. 


4 


1997 


22 


AAH76461 


cDNA corresponding 


5 


1778 


53. 


4 


1997 


24 


ABK83648 


Human cDNA differe 


6 


1752 . 5 


52. 


6 


2015 


23 


ABL04 511 


Drosophila melanog 


7 


1742 


52 . 


3 


2169 


25 


ABX63347 


Human cDNA #34 7 di 


8 


1719.5 


51 . 


6 


1639 


24 


ABS51524 


Human cDNA encodin 


9 


1674 .5 


50. 


3 


4361 


23 


ABL04510 


Drosophila melanog 


10 


1655 


49. 


7 


1271 


21 


AAC38991 


Arabidopsis thalia 


11 


1574 


4 7 . 


3 


1785 


24 


ABZ31980 


Candida albicans e 


12 


1571 


47 . 


2 


1991 


20 


AAX76332 


Pneumocystis carin 


13 


1542.5 


46. 


3 


1818 


25 


ABT19056 


Aspergillus fumiga 


14 


1542.5 


46. 


3 


1818 


25 


ABT20876 


Aspergillus fumiga 


lb 


1527 


45. 


8 


2128 


20 


AAX76373 


P. carinii lysyl-t 


16 


1510.5 


45. 


3 


1903 


25 


ABT18462 


Aspergillus fumiga 


17 


1510.5 


45. 


3 


1903 


2b 


ABT20278 


Aspergillus fumiga 


18 


1510.5 


45. 


3 


3903 


25 


ABT17868 


Aspergillus fumiga 


19 


1510.5 


45 . 


3 


3903 


25 


ABT19682 


Aspergillus fumiga 


20 


986 


29. 


6 


1163020 


24 


ABQ67197 


Listeria innocua c 


21 


986 


29. 


6 


3011208 


24 


ABQ69245 


Listeria innocua D 


22 


983 


29. 


5 


2944528 


24 


ABA03041 


Listeria monocytog 


23 


976.5 


29. 


3 


1521 


24 


ABN91007 


Staphylococcus epi 


24 


957 


28 . 


7 


1487 


23 


AAS51633 


Staphylococcus aur 


25 


956.5 


28 . 


7 


2365589 


24 


ABA90521 


Genomic sequence o 


26 


947 


28 . 


4 


1506 


23 


AAS53566 


Helicobacter pylor 


27 


946. 5 


28 . 


4 


1692 


19 


AAV53138 


Helicobacter lysyl 


28 


939 


28 . 


2 


3232 


19 


AAV52277 


Streptococcus pneu 


29 


939 


28 . 


2 


2162598 


25 


ABS56454 


Streptococcus pneu 


30 


937. 5 


28 . 


1 


1473 


25 


ABX06381 


S. pneumoniae type 


31 


937.5 


28 . 


1 


1491 


18 


AAT92930 


Streptococcus pneu 


32 


937.5 


28 . 


1 


1491 


22 


AAH42416 


Nucleotide sequenc 



GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Cornpuger: Ltd. 



OM protein - nucleic search, using f rame__plus_jp2n model 



Run on : 



October 23, 2003, 23:47:49 ; Search time 112 Seconds 

(without alignments) 
2498.544 Million cell updates/sec 



Ti tie : 

Perfect score: 
Sequence : 

Scoring table: 



US-09-864-464-10 
3331 

i lslhllrvspsspfsfrsp: 



LTDSQNIKEVLLFPAMKPQE 634 



BLOSUM62 
Xgapop 10.0 
Ygapop 10.0 
Fgapop 6.0 
Deiop 6.0 



Xgapext 
Ygapext 
Fgapext 
Delext 



0 . 5 
0 . 5 

7.0 
7 . 0 



Searched : 



569978 seqs, 220691566 residues 



Total number of hits satisfying chosen parameters 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



1139956 



c 



Post-processing: Minimum Match 0 

Maximum Match 100% 
Listing first 45 summaries 

Command line parameters: 

-MODEL-- f rame+_p2n . model -DEV=xlp 

Q--/cgn2 1 /USPTO_spool_p/US098 64 4 64 /runat_22 102 003_171 60 5_3965/app query . f asta_l 
775 

-DB=Issued_Patents_NA -QFMT=fastap -SUFFIX=p2n . rni -MINMATCH=0 . 1 -LOOPCL=0 
-LOOPEXT=0 -UNITS-bits -START=1 -END=-1 -MATRIX=blosum62 -TRANS=human4 0 . cdi 
-LIST=45 -DOCALIGN=200 -THR_ SCORE=pct -THR_KAX=100 -THR_MIN-0 -AL1GN=15 
-MODE=LOCAL -OUTFMT=pto -NORM=ext -HEAPS1ZE=500 -MTNLEN-0 -MAXLEN=2000000000 
-USER=US09864464 _@CGN_l_l_103_@runat_22102003 _171605_3965 -NCPU=6 -TCPU-3 
-NOJ4MAP -LARGEQUERY -NEG_SCORES 0 -WAIT -DSPBLOCK- = 100 -LONGLOG 

-DEV_TIMEOUT=120 -WARN_T1MEOUT=30 -THREADS- 1 -XGAPOP=10 -XGAPEXT-0 . 5 -FGAPOP=6 
-FGAPEXT=7 -YGAPOP=10 -YGAPEXT=0 . 5 -DELOP=6 -DELEXT=7 



Database : 



Issued_Patents_NA; * 

1 : /cgn2_6/ptodata/2/ina/5A_COMB. seq: * 

2 : /cgn2_6/ptodata/2/ina/5B_COMB. seq: * 

3 : /cgn2_6/ptodata/2/ina/6A_COMB. seq: * 

4 : /cgn2__6/ptodata/2/ina/6B_COMB. seq: * 

5 : /cgn2_6/ptodata/2/ina/PCTUS_COMB.seq: * 
6: /cgn2 6/ptodat a/2/ina/backf lies 1 . seq : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result Query 





No. 


Score 


Match 


Length 


DB 


ID 


Description 




1 


3331 


100. 


0 


2175 


3 


US-09-357-251-9 


Sequence 9, Appli 




2 


2336 


70. 


1 


214 3 


3 


US-09-357-251-13 


Sequence 13, Appl 




3 


1571 


47. 


2 


1991 


2 


US-08-4 15-593-40 


Sequence 40, Appl 




4 


1527 


45. 


8 


2128 


2 


US-08-415-593-39 


Sequence 39, Appl 




5 


976. 5 


29. 


3 


1521 


4 


US-09-134-001C-470 


Sequence 470, App 




6 


94 6.5 


28 . 


4 


1692 


1 


US-08-451-715A-9 


Sequence 9, Appli 




7 


939 


28 . 


2 


3232 


4 


US-08-961-527-144 


Sequence 144, App 




8 


937 . 5 


28 . 


1 


14 91 


3 


US-08-833-876-1 


Sequence 1, Appli 




9 


937 . 5 


28 . 


1 


1491 


4 


US-09-483-054-1 


Sequence I, Appli 




10 


929 


27. 


9 


604 


3 


US-09-357-251-11 


Sequence 11, Appl 




11 


926 


27 . 


8 


1566 


4 


US-09-328-352-1692 


Sequence 1692, Ap 




12 


893.5 


26. 


8 


1290 


4 


US-0 9-107-5 32A-3187 


Sequence 3187, Ap 


c 


13 


877 . 5 


26. 


3 


1770 


4 


US- 09-2 52- 9 91 A- 1522 


Sequence 1522, Ap 




1.4 


87V 


26. 


3 


1524 


4 


US-0 9-252-991A-1 651 


Sequence 1651, Ap 


c 


15 


8 60. 5 


25. 


8 


1830121 


4 


US-09-557-884-1 


Sequence 1, Appl 


c 


16 


860.5 


25. 


8 


1830121 


4 


US-09-643-990A-1 


Sequence 1 , Appl 




17 


822 


24 . 


7 


580073 


4 


US-08-545-528D-1 


Sequence 1 , Appli 




18 


785 


23. 


6 


1581 


2 


US-08-898-780A-1 


Sequence 1, Appli 




19 


785 


23. 


6 


1581 


3 


US-09-270-91 7-1 


Sequence 1, Appli 


c 


20 


753 


22. 


6 


4403765 


3 


US-09-103-840A-2 


Sequence 2, Appl 


c 


21 


753 


22 . 


6 


4411529 


3 


US-09-103-840A-1 


Sequence 1 , Appl 




22 


741.5 


22. 


3 


3072 


4 


US-09-221-017B-337 


Sequence 337, App 




23 


725 


21 . 


8 


1230025 


4 


US-09-198-452A-1 


Sequence 1, Appl 




24 


708 


21. 


3 


702 


3 


US-09-357-251-15 


Sequence 15, Appl 


c 


25 


518 


15. 


6 


680 


3 


US-08-896-1 64-39 


Sequence 39, Appl 




2 6 


447 


13. 


4 


813 


4 


US-09-252-991A-1600 


Sequence 1600, Ap 




27 


424 . 5 


12. 


7 


894 


4 


US-09-252-9 91A-15 99 


Sequence 1599, Ap 




28 


387 


11 . 


6 


263 


4 


US-09-313-2 94A-330 9 


Sequence 3309, Ap 


c 


29 


355 . 5 


10 . 


7 


537 


4 


US-09-107-532A-108 


Sequence 108, App 




30 


325 


9. 


8 


1830121 


4 


US-09-557-884-1 


Sequence 1, Appl 




31 


325 


9. 


8 


1830121 


4 


US-09-643-990A-1 


Sequence 1, Appl 




32 


323 


9. 


7 


287 


2 


(JS-08-415-593-34 


Sequence 34, Appl 


c 


33 


316 


9. 


5 


1 803 


4 


US-09-252-991A-12466 


Sequence 12466, A 




34 


316 


9 . 


5 


1971 


4 


US-09-252-9 91A-13004 


Sequence 13004, A 


c 


35 


315 


9. 


5 


1664976 


4 


US-08-916-4213-1 


Sequence 1, Appl 




36 


314 . 5 


9. 


4 


1002 


4 


US-09-328-352-3015 


Sequence 3015, Ap 




37 


314 . 5 


9. 


4 


2007 


4 


□S-08-829-4C2-1 


Sequence 1 , App. I i 




38 


310.5 


9. 


3 


1782 


4 


US-09-134-001C-755 


Sequence 755, App 




39 


276. 5 


8. 


3 


13 62 


4 


US-09-328-352-1146 


Sequence 1146, Ap 


c 


40 


270.5 


8 . 


1 


13104 


4 


US-08-961-527-34 


Sequence 34, Appl 




41 


269.5 


8. 


1 


1770 


4 


US-09-107-532A-2432 


Sequence 2432, Ap 




42 


269. 5 


8 . 


1 


1 948 


3 


US-09-352-990-1 


Sequence 1, Appli 




43 


268.5 


8 . 


1 


1764 


1 


US-08-844-280-1 


Sequence 1, Appli 




44 


268.5 


8. 


1 


1764 


3 


US-09-006-726-1 


Sequence 1, Appli 




45 


261 


7 . 


8 


392 


3 


US-08-833-876-3 


Sequence 3, Appli 
















ALIGNMENTS 





RESULT 1 
US-09-357-251-9 



GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM protein - nucleic search, using frame plus p2n model 



Run on: 



October 24, 2003, 01:37:25 ; Search time 414 Seconds 

(without alignments) 
4106.912 Million cell updates/sec 



Title : 

Perfect score 
Sequence : 

Scoring table 



Searched : 



US-09-864-464-10 
3331 

1 LSLHLLRVSPSSPFSFRSPL 



LTDSQNIKEVLLFPAMKPQE 634 



BLOSUM62 
Xgapop 10.0 
Ygapop 10.0 
Fgapop 6.0 
Uelop 6.0 



Xgapext 
Ygapext 
Fgapext 
Delext 



0.5 
0.5 
7 . 0 
7 . 0 



1792395 seqs, 1340900451 residues 



Total number of hits satisfying chosen parameters 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



3584790 



Command line parameters: 
-MODEL=f rame+_p2n .model -DEV=xlp 

Q=/cgn2 l./USPTO_spool_p/US09864464/runat_22102003_171606_4046/app_query . fasta 1 
77 5 

-DB=Published_Applications_NA -QFMT-=f astap -SUFFlX=p2n . rnpb -MINMATCH=0 . 1 
-LOOPCL=0 -LOOPEXT=0 -UNITS-bits - START-- 1 -END=-1 -MATRTX=blosum62 
-TRANS=human4 0. cdi -LIST-45 -DOCALIGN-200 -THR SCORE=pct -THR MAX-100 
-THR_MTN=0 -ALIGN-- 15 -MODE=LOCAL -OUTFMT=pto -NORM=exr, -FiEAPSI ZE-500 -MINLEN=0 
-MAXLEN=2000000000 -USER=US0 98 64 4 64_ @CGN__1_1_1 07_@ runat_221 0200 3_17 1 606_4 04 6 
-NCPU=6 -ICPU=3 -NO_MMAP -LARGEQUERY -NEG_SCORES=0 -WAIT -DSPBLOCK= 1 00 
-LONGLOG -DEV_TTMEOUT = 120 -WARN_TlMEO T JT--3 0 -THREADS = 1 -XGAPOP=10 -XCAPEXT=0 . 5 
-FGAPOP=6 -FGAPEXT --7 -YGAPOP-10 -YGAPEXT = 0 . 5 -DELOP=6 -DELEXT=7 



Database : 



Publ i shed 
1 
2 
3 
4 
5 
6 
7 
8 
9 

10 



/cgn2_ 
/ cgn2 
/cgn2_ 
/ cgn2_ 
/cgn2_ 
/ cgn2_ 
/cgn2_ 
/cgn2_ 
/ cgn2_ 
/ cgn2 



Appl icat ions_NA: * 

6/ptodata/l/pubpna/US07__PUBCOMB. seq: * 
6/ptodata/l/pubpna/PCT_NEW_PUB, seq: * 
6/ptodata/l /pubpna/US0 6_NEW_PUB. seq: * 
6/ptodata/l /pubpna/US0 6_PUBCOMB. seq: * 
6/ptodata/l/pubpna/US07_NEW__PUB. seq: * 
6/ptodata/l/pubpna/PCTUS_ PUBCOMB. seq: * 
6/ptodata/l/pubpna/US08_NEW_PUB. seq: * 
6/ptodata/l/pubpna/US08_PUBCOMB. seq: * 
6/ptodata/l/pubpna/US09A_PUBCOMB. seq: * 
_6/ptodata/l/pubpna/US09B PUBCOMB. seq: * 



11 

12 
13 
14 
15 
16 
17 



/cgn2_6/puodata/l/pubpna/US0 9C_PUBCOMB. seq: 
/cgn2_ 6/ptodata/l/pubpna/US09_NEW_PUB.seq: * 
/cgn2_6/ptodata/l/pubpna/US10A PUBCOMB. seq: 
/cgn2_6/ptodata/l/pubpna/US10B_PUBCOMB . seq: 
/cgn2_6/ptodata/l/pubpna/US10_NEW_PUB. seq: * 
/cgn2_6/ptodata/l/pubpna/US60_NEW_PUB. seq: * 
/cgn2 6/ptodata/l/pubpna/US60 PUBC0M3 . seq : * 



Prod. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Result 



Query 



o . 


Score 


Match 


T _ i_ 1 

Length 


DB 


I D 


1 


*1 ~7 /I O 

17 4 2 


52 . 


3 


2169 


1 3 


np 1 P\ C\ A A f\ O. C\ O A *' I 

US-10-044-090-34 / 


2 


1719.5 


51. 


6 


i ron 

1639 


i a 

1 4 


n P 1 1 P\ r\ A '3 A O "7 1 A y| 

US-l 0-043-48 / -1U4 


3 


157 4 


47 . 


3 


178 5 


i o 

1 2 


n f i pi p> o o cou f~ ^ r 'i 
US-J 0-U JZ-DOO-DZD / 


4 


1542.5 


46. 


3 


1 pi i p> 

1818 


1 4 


nn 1 Pi 1 O P> ""7 1/1 O O O /" 

US -10-128-714-2226 


5 


1 54 2 . 5 


46. 


3 


1 P> 1 p> 

1818 


1 A 

1 4 


HP 1 A ion "J 1 A ~ 7 O O /~ 

US-10-128-714 - / 226 


6 


1 P 1 Pi c 

1510.5 


45 . 


3 


1 Pi A "1 

1903 


1 A 

1 4 


nf> 1 Pi 1 O O "71/1 1 '1 O /" 

US-10-lzo - / 1 4 - 1 2 2 6 


1 


1510.5 


45. 


3 


1 f\ Pi O 

1903 


1 /I 

1 4 


no i pi i o o o o z'" 

US- 10-128-714-6226 


8 


1 f— 1 f-\ r— 

1510.5 


45. 


3 


3903 


1 4 


nn 1 Pi "1 O P) *7 1 d O ^ /" 

US -10-128-71 4-226 


Pi 

9 


i r~ i Pi r - 

1510.5 


45. 


3 


o pi r\ t 

3 90 3 


1 4 


tto in 1 o o "71>i coo/" - 

US-10-128-714-5226 


1 0 


957 


28 . 


7 


1 A O "J 

14 87 


9 


US-09-815-242-4215 


1 1 


947 


28 . 


4 


1506 


9 


T T P> Pi Pi O T T A d A o A ^ 

US -09-815-242-720 3 


1 O 

12 


p\ O * 7 r 

93 / . 5 


28 . 


1 


14 91 


p> 
9 


n p« A n a 1 r o >1 O a ^ A o 

US-0 9-815-242- 9303 


13 


pt **> **> 
933 


28 . 


0 


1 4 97 


9 


TIP' A A A 1 C O^O /~ A IZ t~\ 

US-09-815-242-6459 


1 A 

1 4 


Pi O O 

933 


28. 


0 


1 (7 A A 

1500 


9 


iip> r\ pi oic a j a a at a 

US-09-8 15-242-3952 


1 5 


925.5 


27. 


8 


/" 1 "7 

6617 


1 A 

1 4 


TIP 1 1 Pi O 1 Pi Pi /" 1 

US-10-210-z9b-l 


1 6 


p\ i "7 r* 

917.5 


27 . 


5 


1518 


9 


r i rt P\ Pi P) 1 C O^O /" "~> L f 

US-09-8 15-242-6356 


17 


906 


27 . 


2 


1518 


9 


US-09-741-669-159 


1 8 


906 


27 . 


2 


1518 


9 


US-09-81 5-242-6188 


19 


876 


26. 


3 


1506 


9 


US-09-815-242-7845 


20 


868 


26. 


1 


640681 


10 


US-09-790-988-1 


21 


860. 5 


25. 


8 


1509 


9 


US-09-815-242-7108 


22 


860. 5 


25 . 


8 


1830121 


14 


US-10-329-960-1 


23 


822 


24 . 


7 


38459 


12 


US-09-960-858-3 


24 


822 


24 . 


7 


38459 


12 


US-09-960-870-3 


25 


822 


24 . 


7 


580073 


12 


US-10-205-220-1 


26 


7 90.5 


23. 


7 


3309400 


10 


US-09-738-626-1 


27 


782 . 5 


23 . 


5 


1578 


10 


US-09-738-626-2954 


28 


751 


22 . 


5 


1518 


10 


US-09-712-363-129 


29 


734 


22 . 


0 


3279 


14 


US-10-156-761-7275 


30 


734 


22 . 


0 


9025608 


14 


US-10-156-761-1 


31 


732 


22 . 


0 


987 


9 


US-09-815-242-9774 


32 


706.5 


21 . 


2 


873 


9 


US-09-815-242-8618 


33 


693 


20 . 


8 


565 


14 


US-] 0-066-543-2312 


34 


666.5 


20. 


0 


975 


10 


US-09-974-300-1181 


35 


660 


19. 


8 


554 


10 


US-09-796-692-5756 


36 


660 


19. 


8 


554 


14 


US-10-040-862-5756 


37 


599 


18. 


0 


537 


14 


US-10-066-543-163 


38 


569 


17. 


1 


484 


11 


US-09-918-995-25525 


39 


556.5 


16. 


7 


873 


8 


US-08-781-986A-196 


40 


518 


15. 


6 


680 


9 


US-09-835-992A-39 



Descr ipt ion 



c 



c 



c 
c 



Sequence 34 7, App 
Sequence 104, App 
Sequence 6267, Ap 
Sequence 2226, Ap 
Sequence 7 22 6, Ap 
Sequence 1226, Ap 
Sequence 6226, Ap 
Sequence 226, App 
Sequence 5226, Ap 
Sequence 4 215, Ap 
Sequence 7203, Ap 
Sequence 9303, Ap 
Sequence 6459, Ap 
Sequence 3952, Ap 

Sequence 1, Appli 
Sequence 6356, Ap 
Sequence 159, App 
Sequence 6188 , Ap 
Sequence 7845, Ap 

Sequence 1, Appli 
Sequence 7108, Ap 
Sequence 1 , Appl 
Sequence 3, Appli 
Sequence 3, Appli 
Sequence 1, Appli 
Sequence 1 , Appl 
Sequence 2954 , Ap 
Sequence 129, App 
Sequence 7275, Ap 
Sequence 1, Appl 
Sequence 9774 , Ap 
Sequence 8618, Ap 
Sequence 2 312, Ap 
Sequence 1181, Ap 
Sequence 5756, Ap 
Sequence 57 56, Ap 
Sequence 163, App 
Sequence 25525, A 
Sequence 196, App 
Sequence 39, Appl 



GenCore version 5.1,6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM protein - nucleic search, using frame plus p2n model 



Run on: 



October 23, 2003, 23:39:54 



; Search time 2884 Seconds 

(without alignments) 

5342.940 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-09-864-464-10 
3331 

1 LSLHLLRVSPSSPFSFRSPL 



LTDSQNIKEVLLFPAMKPQE 634 



Scoring table: 



BLOSUM62 



Xgapop 10.0 

Ygapop 10.0 

Fgapop 6 . 0 

Delop 6.0 



Xgapext 
Ygapext 
Fgapext 
Del ext 



0.5 
0.5 
7.0 
7.0 



Searched : 



22781392 seqs, 12152238056 residues 



Total number of hits satisfying chosen parameters: 



45562784 



Minimum DB seq length: 0 

Maximum DB seq length: 20000C0000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Command line parameters: 
-MODEL=f rame+_p2n. model -DEV=xlp 

Q=/cgn2_ l/USPTO_spool p/US098 64 4 64 /runat_22102003_171605_ 394 7/app_ query. fast a 
775 

-DB=EST -QFMT=fastap -SUFFIX=p2n . rst -MINMATCH=0 . 1 -LOOPCL-0 -LOOPEXT=0 
-UNITS=bits -START=1 -END--1 -MATRIX=blosum62 -TRANS=human4 0 . cdi -LIST=45 
-DOCALIGN=200 -THR_SCORE-pct -THR_MAX-100 -THR_MIN-0 -ALIGN--15 -MODE=LOCAL 
-OUTFMT=pto -NORM=ext -HEAPS! ZE=500 -MINLEN=0 -MAXLEN=2 0 000 00000 
-USER=US09864464_@CGN_1_ 1_3549_ @runat_22102003__171 605_3947 -NCPU-6 -ICPU=3 
-NO_MMAP -LARGEQUERY -NEG_SCORES=-0 -WAIT -DS PBLOCK=l 00 -LONGLOG 

-DEV_TIMEOUT=120 -WARN_TIMEOUT=30 -THREADS=1 -XGAPOP-10 -XGAPEXT=0.5 -FGAPOP= 
-FGAPEXT=7 -YGAPOP=10 -YGAPEXT=0 . 5 -DELOP=6 -DELEXT=7 

Database : EST:* 



1: 
2 : 
3: 
4 : 
5: 
6: 
7 : 
8: 
9: 
10 



em_es tba : * 
em_esthum : * 
em e s t i n : * 
em estmu : * 
em_es tov : * 
em_es tpl : * 
em es tro : * 
em htc : * 



gb_estl : * 
gb est 2 : * 



1 1 : 


gb_ 


ht c . 




1 2 : 


gb_ 


e s 1 3 : * 




1 3 : 


gb_ 


est 4 : * 




14 : 




esto : * 




1 b : 


em 


est run : w 




1 6 : 


cm 


estom: * 




17 : 


em 


gss hum: 


•X, 


1 8 : 


em 


gss inv : 




19 : 


em 


gss pin: 




20 ' 


em 


gss vrt : 


■X 


21 


em 


gss run: 




22 


: em 


gss mam: 




23 


: em 


gss mus : 




24 


: em 


gss pro: 




25 


: cm 


gss rod: 




26 


: cm 


gss phg : 




27 


: em 


gss vrl : 




28 


: gb~ 


gss 1 : * 




29 


: gb 


gss2 : * 





Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result Query 

No. Score Match Length OR ID Description 



1 


3331 


100. 


0 


2220 


11 


AY103979 


AY103979 Zea mays 


2 


1784 


53. 


6 


1963 


11 


AK089053 


AK089053 Mus muscu 


3 


1782 


53. 


5 


1963 


11 


AK090203 


AK090203 Mus muscu 


4 


1775.5 


53. 


3 


1960 


11 


BC013246 


BC013246 Homo sapi 


5 


1331 


40. 


0 


1 7 6 


14 


CD432929 


CD432929 EL01N0302 


6 


1314 


39. 


4 


771 


14 


CD445806 


CD445806 EL01T0203 


7 


1151 . 5 


34 . 


6 


860 


14 


CB627677 


CB627677 OSlIEb02M 


8 


1138 


34. 


2 


702 


9 


AI898064 


AI898064 EST267507 


9 


1093 


32. 


8 


663 


12 


BJ549326 


BJ549326 BJ549326 


10 


1078 


32. 


4 


684 


10 


BE585724 


BE585724 Est#lSP6_ 


11 


1067 


32. 


0 


733 


13 


BQ117168 


BQ117168 EST602744 


12 


1066 


32. 


0 


882 


13 


BQ215534 


BQ215534 AGENCOURT 


13 


1053 


31 . 


6 


631 


9 


AW066125 


AW066125 687006E08 


14 


1052 


31 . 


6 


881 


13 


BU176615 


BU176615 AGENCOURT 


15 


1049 


31. 


5 


882 


12 


BI905374 


BI905374 603167411 


16 


1037 


31 . 


1 


659 


9 


AW650691 


AW650691 EST329145 


17 


1017 


30. 


5 


601 


14 


CD443593 


CD443593 EL01N0428 


1 8 


1011 


30. 


4 


932 


13 


BX327446 


BX327446 BX327446 


19 


1008 . 5 


30. 


3 


672 


9 


AW685961 


AW685961 NF036H08N 


20 


1004 . 5 


30. 


2 


1201 


9 


AL567684 


AL567684 AL567684 


21 


999 


30. 


0 


991. 


29 


CNS06YSR 


AL421441 T3 end of 


22 


998 


30. 


0 


942 


14 


CB183903 


CB183903 AGENCOURT 


23 


997,5 


29. 


9 


915 


13 


BX327445 


BX327445 BX327445 


24 


997 


29. 


9 


1201 


9 


AL581487 


AL581487 AL581487 


25 


994 


29. 


8 


632 


9 


AV833455 


AV833455 AV833455 


26 


991 


29. 


8 


650 


14 


CA741660 


CA741660 wialc.pkO 


27 


988 


29. 


7 


929 


1 4 


CB209380 


CB209380 AGENCOURT 


28 


984 


29. 


5 


919 


10 


BG682242 


BG682242 602629550 



GcnCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM protein - nucleic search, using frame plus_p2n model 



Run on: 



October 23, 2003, 21:59:29 



; Search time 5432 Seconds 

(without alignments) 

4774.798 Million cell updates/sec 



Title : 

Perfect score: 
Sequence : 



US-09-864-464-10 
3331 

1 LSLHLLRVSPSSPFSFRSPL 



LTDSQN1KEVLLFPAMKPQE 634 



Scoring table: 



BLOSUM62 



Xgapop 10.0 
Ygapop 10.0 
Fgapop 6.0 
Delop 6.0 



Xgapext 
Ygapext 
Fgapext 
Doloxt 



O.b 
0.5 
7.0 
7.0 



Searched : 



2888711 seqs, 20454813386 residues 



Total number of hits satisfying chosen parameters: 



5777422 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Command line parameters: 
-MODEL=f rame+_p2n .model -DFV=xlp 

Q-/cqn2_l/USPTO_spool_p/US098 64 4 64/runat_22102 003_171604_3935/app query. fast a 1 . 
775 

-DB=GcnEmbl -QFMT=fastap -SUFFIX=p2n . rge -MTNMATCH=0 . 1 -LOOPCL=0 -LOOPEXT=0 
-UNlTS-bits - START- 1 -END=-1 -MATRIX=blosum62 -TRANS=human40 . cdi -LIST-45 
-DOCAL1GN=200 -THR_SCORE=pct -THR_MAX-100 -THR_MIN-0 -ALIGN=15 -MODE=LOCAL 
-OUTFMT=pto -NORM=ext -HEAPSI ZE=500 -M1NLEN=0 - MAX LEN=2 000000000 
-USER=US098 64 4 64_@CGN_l___l_4 958_@runat _22102003_171604_3 935 -NCPU=6 -ICPU=3 
-NO_ MMAP -LARGEQUERY -NEG_S CORES =0 -WAIT -DS PBLOCK=l 00 -LONGLOG 

-DEV_T1MEOUT---120 -WARN_TIMEOUT 30 -THREADS-1 -XGAPOP -10 -XGAPEXT-0 . 5 -FGAPOP-6 
-FGAPEXT=7 -YGAPOP-10 -YGAPEXT=0.5 -DELOP=6 -DELEXT=7 

Database : GenEmbl : * 



1 : 
2 : 
3: 
4 : 
5: 
6: 
7 : 
8: 
9: 
10 



gb_ba : * 
gb htg:* 
gb _in : * 
gb_om: * 
gb_ov : * 
gbjpat : * 
gb_ph: * 
gb_pl : * 
gb_pr : * 
gb ro : * 
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sLs : * 


12 : 
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sy : * 


13 : 


gb 


un : * 


14 : 


gb_ 


vi : * 
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ba : * 
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run:* 
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em 


hum : * 
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em 
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mu : * 
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em 


om : * 
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em 


or : * 
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em 


ov : * 


23 : 


em 


pat : * 
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em 


ph : * 


25 


: em 


pi : * 


2 6 


: em 


ro : * 


27 


: em 


s Ls : * 


28 


: em 


un : * 


29 


: em 


vi : * 


30 


: em 


htg hum : * 


31 


: em 


htg inv : * 


32 


: em 


htg other : * 


33 


: em 


htg raus : * 


34 


: em 


htg pin:* 


35 


: em 


htg rod : * 


36 


: em 


htg mam : * 


37 


: em 


htg vrt : * 


38 


: em 


sy : * 


39 


: cm 


htgo hum : * 


40 


: em 


htgo mus : * 


41 


: em 


htgo other:* 



Fred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 



Query 



No. 


Score 


Match 


Length 


DB 


ID 


Descri.pt ion 


1 


3331 


100. 


0 


2175 


6 


AR164421 


AR164421 Sequence 


2 


2336 


70. 


1 


2143 


6 


AR164423 


AR164423 Sequence 


3 


2287 . 5 


68 . 


7 


1938 


8 


BT002123 


BT002123 Arabidops 


4 


2287.5 


68 . 


7 


2083 


8 


AY099551 


AY099551 Arabidops 


5 


2287.5 


68. 


7 


2193 


8 


AF125574 


AF125574 Arabidops 


6 


2084 


62 . 


6 


2075 


8 


LELYSRSGN 


X94451 L.esculcntu 


7 


1854 


55. 


7 


45173 


8 


ATAC00 9918 


AC009918 Arabidops 


8 


1854 


55. 


7 


100835 


8 


ATAC016795 


AC016795 Arabidops 


9 


1790.5 


53. 


8 


2205 


] 0 


BC027356 


BC027356 Mus muscu 


10 


1789 


53. 


7 


2023 


10 


BC036289 


BC036289 Mus muscu 


11 


1781.5 


53. 


5 


1970 


10 


CLLYTRSY 


Z31711 C.longicaud 


12 


1781 . 5 


53. 


5 


1997 


9 


BC0041 32 


BC004132 Homo sapi 


13 


1780.5 


53. 


5 


2146 


10 


BC035324 


BC035324 Mus muscu 


14 


1778 


53. 


4 


1997 


6 


AX214528 


AX214528 Sequence 


15 


1778 


53. 


4 


1997 


9 


D32053 


D32053 Homo sapien 


16 


1775.5 


53. 


3 


1970 


9 


HUM0RFKG1S 


D31890 Homo sapien 
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